Text to speech conversion of text messages from mobile communication devices

ABSTRACT

A method includes providing a user interface, at a mobile communication device, that includes a first area to receive text input and a second area to receive an identifier associated with an addressee device. The text input and the identifier are received via the user interface. A short message service (SMS) message including the text input is transmitted to a Text to Speech (TTS) server for conversion into an audio message and for transmission of the audio message to the addressee device associated with the identifier. An acknowledge message transmitted from the TTS server permits the addressee device to allow delivery of the audio message or to decline delivery of the audio message. The TTS server transmits the audio message in response to the addressee device allowing delivery of the audio message. A confirmation message is received from the TTS server that indicates that a reply voice message has been received from the addressee device in response to the audio message.

RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.12/628,693, filed Dec. 1, 2009, which is a continuation patentapplication of and claims priority from U.S. patent application Ser. No.11/858,775, now U.S. Pat. No. 7,649,877, entitled “Mobile Device forSending Text Messages”, filed on Sep. 20, 2007, which claims priorityfrom U.S. patent application Ser. No. 10/491,797, now U.S. Pat. No.7,310,329, entitled “System for Sending Text Messages Converted intoSpeech Through an Internet Connection to a Telephone and Method forRunning It”, filed on Apr. 6, 2004, which claims priority from PCTapplication PCT/IT02/00673 filed on Oct. 21, 2002 and from ItalianPatent Application No. FI0A000199 filed on Oct. 22, 2001, all of whichare incorporated herein by reference in their entirety.

BACKGROUND

Different systems use the Internet as a means of transferring data, butsuch systems are typically not capable of sending text messages from theInternet and converting them into vocal messages on a telephone line.

By means of an IP connection it is possible to download songs, totelephone, to send e-mail messages, and to send SMS messages.

The most common peripheral unit, available for everyone, to be connectedto the Internet is the modem. This word is a mixture of two words,modulator and demodulator. The function of a modem is to convert digitaldata in the form of electric pulses into a form that can be used fortransmission on normal telephone lines. Therefore, the modem convertselectric signals coming from the computer into sounds and permits theirtransmission by means of the telephone line. The receiving modem makesthe opposite operation: as soon as it receives the sounds from thetelephone line, it reconverts them into electric signals in order tomake them available to the computer it is connected to. This furtheroperation is called demodulation. Technical evolutions of modems includenew peripheral units and digital platforms that send data on a suitablyshaped line without any need to convert them into sounds, such as ISDNdata transmitting cards. Another resource available to users is the DSLconnection. Using this system it is possible to divide the vocaltransmission from the data transmission by using particular filters.

Also referring to Internet connections, another available technology forusers is the system of connection by satellites. Thanks to these systemseven people who are not experts at hardware and software can surf on theInternet, send e-mail messages, send SMS messages, and send and receivefax and other related operations. By means of data transmitting cards,permitting the transmission of complex data in audio and even videoforms, also called technological platforms, many companies develop moreand more complex programs and applications in order to provide firmswith ancillary services.

One of the easiest ways to send communications with a telephone, usingthe Internet, is writing SMS messages. For this purpose, the user fillsin a form on the web writing a text message and sends it to aserver/gateway that transfers it to the mobile number the message isdirected to.

However, this means of communication has some drawbacks. SMS messagesare short and the communication reaches almost exclusively the users ofthe country where the portal works. In addition, these SMS messagesreach neither cordless phones not using the GSM system nor common housetelephones. The communication that reaches the final destination ismainly textual, in the form of a message that can be read on a display,not vocal. Another important characteristic is that users cannot verifyimmediately the result of the SMS message sent on-line.

“Text-to-speech” (TTS) technology exists, which permits someinteractions via internet/telephone. By these means, anInternet/telephone service where TTS is in use allows a servicesubscriber to receive a notification that a message has arrived on thephone or a notification to dial a telephone number and listen to themessage in the form of speech. Nevertheless, this system is notconvenient, as it needs registrations, and is limited and not easy to beused by an inexpert user. The TTS technology principally reads text andconverts the text to speech. TTS technology has been useful forteaching, utility and support purposes, in particular for those who aredisabled and blind, but as regards to its use with telephoneinteractions, it is mostly based on “inbound” applications, i.e.,centered in order to support inbound and not outbound communications.Vocal tree menus are an example and are used by companies in order toautomatically provide a caller with information.

Other systems permitting communication by voice with another user viathe Internet also exist. But with the traditional methods, peopleholding a single telephone line cannot communicate by voice in a simpleand comfortable way through the computer with another telephone. Infact, in order to make a call using the Internet, the user needssoftware, loudspeakers and microphones. An Internet user is not able tosend text messages to the designated telephones, except for the messagesappearing on the display of the phone.

Also, e-mails, the most widespread means of communication via the web,cannot give guaranties about the correct receipt of the communication bythe addressee. As a matter of fact, a simple automatic acknowledgetransmission of the e-mail by the addressee to the sender is notsufficient to say he/she has entirely read the communication orunderstood its meaning.

BRIEF DESCRIPTION OF THE DRAWINGS

The present disclosure may be better understood, and its numerousfeatures and advantages made apparent to those skilled in the art byreferencing the accompanying drawings.

FIG. 1 is a general diagram that illustrates elements of a communicationsystem.

FIG. 2 is a general diagram that illustrates operation of the system ofFIG. 1.

FIG. 3 is a general diagram that illustrates operation of the system ofFIG. 1.

FIG. 4A illustrates a display that may be used to send an emergencymessage.

FIG. 4B illustrates a display that may be used to send a job relatedmessage.

FIG. 4C illustrates a display that may be used to send a message to anew friend.

FIG. 4D illustrates a display that may be used to communicate.

FIG. 5 is a general flow diagram to illustrate a method ofcommunicating.

DETAILED DESCRIPTION

FIG. 1 shows a system that includes hardware (1), used by the user forwriting messages and sending them; a connection to Internet (2), by amodem (6) or ISDN connecting card, net card or alternative systemsconnecting to Internet; a server (3) where a software is installed,directing the various functions and changes text messages into speech,sending them by means of voice modem and/or technological platforms fortransmitting vocal data (4) by a telephone line (5) to a telephone (7)of the addressee of the communication. The addressee, if he wishes, caninteract with the vocal message received by using a keyboard or a keypadassociated with the telephone (7) or by speaking into the telephone.This possible interaction will be sent (8) on the server and filed forthe sender who can then read and/or listen to it.

The text written by a user, by an IP connection, gets to a server, whereit is converted into speech and sent to a telephone number by atelephone line. The entire executing process is then saved on a filethat can be read by means of an IP connection. This system involveshardware, data transmission systems, telephone lines, specializedsoftware, receiving apparatus and many technical details.

Referring to FIG. 2, a flowchart shows the steps of the system in thecase that the user uses a form found on the Internet. On a web page, theuser can find a form to write text. This form may contain severaldetails. The user writes a message (B) and presses enter to send themessage to the server (C). The server receives the message, encodes itusing software (D), such as a “voice engine”, and changes the text intospeech using text-to-speech (TTS) technology. The telephone number ofthe addressee is received by the software that is configured to send thecommunication by means of a peripheral, such as a voice modem and/or atechnological platform (E). The peripheral sends the message to theaddressee by a normal telephone line (F). The addressee receives thecommunication, which was originally sent in the form of text and thenconverted into speech. In addition, the addressee (G) can interact withthe caller, using the keyboard, by the DTMF system or directly via themicrophone of the telephone. So doing, he can answer questions and/orsend communications that will be saved in log files available for dataanalysis and decoding.

Referring to FIG. 3, a flowchart briefly shows the action of theInternet user in the case that he has downloaded software to run thesystem. The Internet surfer (A) writes the message (A1) directly into atable of the software installed on his/her hardware, without using aform on-line as in the previous case. The communication starts and getsto the web server (B) that records it. The software (C) changes the textinto speech and, by means of a voice modem (D) and/or other datatransmitting/receiving cards, sends it to the designated number by atelephone line (E). The communication reaches the addressee's telephone(F). The addressee has the possibility of answering with various options(G). For this system, the surfer uses software specifically provided,connects to the Internet, writes a message via the software interface,sends it to a web server, which converts it into speech by means ofsophisticated software, and sends the communication to the addressee. Bymeans of the software, the user can also send pre-recordedcommunications filed at the server.

FIGS. 4A, 4B, 4C, 4D schematically show how the user can write a messageand send it. In particular, they describe some practical examples. Inthese examples, it is possible to send and/or receive the communicationto and/or from an alias as well as to a telephone number. Additionally,it is possible to send/or receive the communication in a desiredlanguage. In particular, FIG. 4A shows the system used for socialutility or emergency cases; FIG. 4B shows the system used to look for ajob; FIG. 4C shows the system used to look for new friends; FIG. 4Dshows the system for everyday use. These illustrated examples are simplypractical examples but are not to be considered restrictive.

FIG. 5 is a flowchart that describes a system showing how thetext-to-speech (TTS) technology works in a particular case. Using TTStechnology, words and text are fragmented into phonemes and are analyzedby characters, figures, sums, inflections and punctuation. The finalresult is an audio form.

The system and method of the present disclosure allows Internet surfers,or more generally users that have a modem and/or even a more developedanalog or digital card transmitting and receiving data, to be able towrite a text on a form found on the web site and/or by means of aninterface downloaded from software, to send this message to a telephonenumber and make it known to the addressee in real time and/or bypre-recording. This system also constitutes a valid alternative tocommunicate vocally with another person with a single telephone linesimply by writing a text, with no need of loudspeakers and microphonesor subscriptions to services permitting calling via the Internet.

This disclosure aims at transmitting on a common telephone vocalmessages obtained by the conversion of text messages written by the userand sent via the Internet.

In a particular embodiment, a system of the present disclosure involvesuse of hardware and software devices, telematic instruments and systemsfor the transmission of data, which together allow the receipt of avocal message sent originally by the Internet in the form of text.

In a particular embodiment, the system for sending text messagesconverted into speech comprises: means to write and send messages, by adata input device including hardware showing a table where users canwrite a text message and/or order the transmission of a pre-recordedmessage, indicate the telephone number(s) of the addressee or a databasewhere addressees' telephone numbers can be found, and send it to aserver; means to connect the data input device with the server,including a connection to the Internet by modem, data transmitting andreceiving cards, similar apparatus or kits for satellite connection;means to receive and process the text message, by the server, where aprogram is installed that is able to convert the text message into avocal message and set it up for transmission on the phone, andcomprising other applications capable of controlling and filing the datarelative to the entire telephone transaction; means to transmit thevocal message on the telephone line by a voice modem or othertechnological peripherals for the transmission of at least vocal data;and means to receive the message, including a common phone.

Conveniently, it is possible to reply to the vocal messages received bymeans of a suitable keyboard connected to the phone using DTMF keys andtones. Conveniently, the addressee can send a reply vocal message to thesender, so that it is then filed on the server.

Conveniently, on the computer through which data is input, software isinstalled, which permits the transmission of text messages by modem, orotherwise. The computer can also include digital and/or analog cards fortransmitting and receiving pre-recorded communications. The softwareprogram installed by the user may be represented by an icon within thebrowser, in case the user is surfing on the Internet or is not using itat that particular moment.

Conveniently, on the computer through which data is input, there is atable, either created by installed software or an HTML form or similarform, which comprises a suitable interface used to write a message andsend the message to the server carrying out the transaction until themessage reaches its destination. By this interface, the user can writemany messages to various people and send them by means of an automaticconnection to the Internet.

The table is also used to send pre-recorded communications loaded on anexternal server to various addressees and at specific hours.

Conveniently, the table through which messages are written includes anarea for an optional input of the sender's telephone number or his/her“alias” or nickname.

Conveniently, the table through which messages are written includes anarea for the necessary input of the addressee's telephone number orhis/her “alias” or nickname.

Conveniently, the table through which messages are written includes anarea for the input of the language the user wants to use to send themessage to the designated telephone number. Conveniently, the tablethrough which messages are written includes an area for the selection ofthe pre-recorded message to send.

The system permits the transmission from the web of both text andpre-recorded messages in different forms, thanks to a system forcompressing and converting data. Nowadays, the transmission ofpre-recorded messages for telemarketing campaigns and the conversion ofthe text into speech have some technical limits. Particularly recordedmessages converted into files often take such wide dimensions that theyare not easy to send to the addressee. In addition, the “voice engines”,which are the engines that change the text into speech, cannot be easilyused, as they need a computer background on operating systems and thesmall amount of information available cannot be easily understood by allusers.

Conveniently, the audio messages registered in professional forms, aswell as the text files for the conversion in audio using“text-to-speech” technology, are converted into smaller files and sentto the addressee simply using voice modems and not necessarilysophisticated data transmitting cards. As a matter of fact, files thatare too large would not allow the telephone message to be transmittedimmediately. The user on the phone could wait several minutes beforereceiving the communication and could hang up. The larger the file tosend is, the longer the wait may be. The system of the presentdisclosure avoids this drawback thanks to particular technicalsolutions: first, inserting an initial immediate “beep”, starting whenthe receiver is connected, which allows the addressee to hear a signaland hold it waiting, in case he needs less time to listen to thecommunication; second, converting the forms into automatically smallerfiles, thanks to a particular system for the compression of audio files,which makes them in an average mono quality, sufficient for thereception level of the phone; and third, is the possibility to insert atthe beginning of the call a “confirmation file”, i.e., an acknowledgevocal message, which causes the user to interact with the telephonekeyboard for listening or declining the arriving communication.

Conveniently, the text communication can be addressed to an “alias” andnot to a particular telephone number. The server will be able toidentify the alias and connect the communication to a specific telephonenumber, providing the privacy of the users.

This system also allows all users, under authorization, to use the mostapplications and functions available, thanks to the database on the webserver. An Internet user, with his/her password, will be able to sendvarious communications to various addressees, simply by indicating therequired preferences.

Conveniently, the server holds a file archive with a database andmultiple support that allows authorized users to import files withtelephone numbers, write a text or retrieve pre-recorded messages,record them in a different audio form, send them to the addressees, andfile everything on the server database, all using a simple connection tothe Internet, without particular programs or the help of experts in thisfield.

Conveniently, the users will be able to write simple messages and sendthem with synthesized voices and possibly real voices and with an image,chosen among figures at pleasure, like robots, actors, singers, footballplayers and other. With a simple photo and a nickname of the addressee,it is possible to send him/her a communication on the phone withoutknowing his/her number, and also relations between foreign companiesmight develop thanks to the possibility of a translator that convertsthe text into the desired language and sends it to the addressee. Anyonethat has difficulties in speaking will have a new means ofcommunication. People can talk together saving time and money, becauseoperational and telephone costs will be covered by sponsors andadvertising. New jobs related to services, information and publicutility will be created.

Conveniently, the table through which messages are written includes anarea for the insertion of the time and date to send the message.

Conveniently, the table through which messages are written includes anarea for the selection of the tone of voice to use to send the message.Conveniently, the table through which messages are written includes anarea for the selection of the transmission and/or reception of themessages to several telephone numbers.

Conveniently, the table through which messages are written includes anarea for the selection of an automatic repeat of the number, in case thenumber called is engaged and/or free but without answer, choosing howmany calls to try.

Conveniently, the table through which messages are written includes anarea permitting to decide that in case there is an answering machineand/or a call transfer, the message can be transmitted anyway.

Conveniently, the user can send and/or receive the communicationanywhere and to any telephone number. Conveniently, this system allowsthe user to receive an acknowledgement of the receiving message bye-mail or a message on the phone.

Conveniently, this system permits monitoring of every call made by theuser, and the relative results are filed, so that the user can laterverify and check the history of the message. Conveniently, the user hasa password in order to send and/or receive the communication and isauthorized to send and/or receive the communication as the serverautomatically detects it.

Conveniently, the user is authorized by an identification system to usethe database and the functions of the system found on the web serverand, thanks to the program installed on the server, can import data ontohis hardware, input them on the server database, make operations inorder to send messages to several addressees, select pre-recordedmessages, select options, times and dates, and file and save data on theserver.

Conveniently, the data transmission for the outbound calls is notperformed via cable, but through an analog or digital telephone line.

Conveniently, this system permits a user to send and/or receivecommunications in real time and/or pre-recorded messages filed on theserver in such a form suitable to be received by the addressee.

With regards to the reception, the addressee of the message can interactthrough telephone, using the keyboard or answering the message by voice.The server registers the answers and/or interactions left by theaddressee, so that the sender, by means of a specific interface, canconnect to the server, decode the meaning of the DTMF tones and/orlisten to the vocal messages left by the addressee.

Conveniently, writing a text inside a banner and/or form, the user cansend and/or receive a pre-recorded communication with the initial and/orfinal addition of the written text converted into speech.

Conveniently, an acoustic signal is transmitted as soon as there is atelephone connection with the addressee, avoiding immediatedisconnection in case of a brief waiting period before the transmissionof the communication.

Conveniently, if the transmitting file is too large, it is automaticallyconverted and reduced to such an extent that the addressee can correctlyreceive it.

A method to convert text messages into vocal messages and send them bytelephone line is disclosed and is characterized in that it comprisesthe following steps: decoding of the text received by the user;conversion of the text into speech; extension of the size of the vocalfile created; adjustment of the vocal message in such a form to be readby the data transmitting card; interaction with the transmitting card;dialing of the number(s) indicated; transmission of the message on thephone; recording of the data of the telephone call and filing of thedata on a server database; and during conversion, compress the file ifwider and not suitable for a perfect transmission.

This method creates more audio files that are arranged by number andsent, following the order of configuration, all together in the sametelephone call, to allow the addressee to listen on the phone to severalvocal messages arranged in order but included in a single call.

Conveniently, the first stage of conversion of words is called“standardization of the text”, the second stage is the “standardizationof the pronunciation”, the third stage is the pronunciation itself thatputs in order more phonemes. Specific algorithms are used in order tofragment words and find the sound produced by a specific letter. Thelast stage of this process is the “prosody”, which is the speed ofreading, with possible pauses and other details for the perfection ofthe audio. At this stage, the voice modem and/or a technologicalplatform converts the text to voice.

The possibility to send long and wide text messages, later convertedinto vocal messages, to telephone numbers all around the world, permitsa user to send easily and rapidly any kind of communication, no matterthe length of the text. This operation can reduce costs and gives thesender a better assurance than previous systems, permitting a completeconfirmation of the receipt by the addressee.

This acknowledgment is much more complete using the system of thepresent disclosure, which permits a user to check the complete orpartial receipt of the communication, registering the opinions andinteractions of the addressee. The system of the present disclosure hasother advantages: for example, it allows people who have problems withcommunication, e.g., autistic, deaf and dumb, to communicate vocally onthe phone with anyone without using their own voice, sending messages,help calls and other services by phone, so that the written text arrivesto the addressee in the form of speech. Another application of thisdisclosure is to translate the text in any language by means of asimultaneous translator that changes the written text into the desiredlanguage and sends it to the addressee in the form of speech.

The illustrations of the embodiments described herein are intended toprovide a general understanding of the structure of the variousembodiments. The illustrations are not intended to serve as a completedescription of all of the elements and features of apparatus and systemsthat utilize the structures or methods described herein. Many otherembodiments may be apparent to those of skill in the art upon reviewingthe disclosure. Other embodiments may be utilized and derived from thedisclosure, such that structural and logical substitutions and changesmay be made without departing from the scope of the disclosure.Additionally, the illustrations are merely representational and may notbe drawn to scale. Certain proportions within the illustrations may beexaggerated, while other proportions may be reduced. Although specificembodiments have been illustrated and described herein, it should beappreciated that any subsequent arrangement designed to achieve the sameor similar purpose may be substituted for the specific embodimentsshown. This disclosure is intended to cover any and all subsequentadaptations or variations of various embodiments. Combinations of theabove embodiments, and other embodiments not specifically describedherein, will be apparent to those of skill in the art upon reviewing thedescription. Accordingly, the disclosure and the figures are to beregarded as illustrative rather than restrictive.

The Abstract of the Disclosure is submitted with the understanding thatit will not be used to interpret or limit the scope or meaning of theclaims. In addition, in the foregoing Detailed Description, variousfeatures may be grouped together or described in a single embodiment forthe purpose of streamlining the disclosure. This disclosure is not to beinterpreted as reflecting an intention that the claimed embodimentsrequire more features than are expressly recited in each claim. Rather,as the following claims reflect, inventive subject matter may bedirected to less than all of the features of any of the disclosedembodiments. Thus, the following claims are incorporated into theDetailed Description, with each claim standing on its own as definingseparately claimed subject matter.

The above-disclosed subject matter is to be considered illustrative, andnot restrictive, and the appended claims are intended to cover all suchmodifications, enhancements, and other embodiments, which fall withinthe scope of the present invention. Thus, to the maximum extent allowedby law, the scope of the present invention is to be determined by thebroadest permissible interpretation of the following claims and theirequivalents, and shall not be restricted or limited by the foregoingdetailed description.

What is claimed is:
 1. A non-transitory computer-readable storage mediumcomprising instructions that, when executed by the processor, cause theprocessor to: provide a user interface at a mobile communication device;receive a text input and an identifier associated with an addresseedevice via the user interface; transmit a message including the textinput from the mobile communication device to a server for conversioninto an audio message and for transmission of the audio message to theaddressee device associated with the identifier; receive a reply messagefrom the server, wherein the reply message includes a message from theaddressee device in response to the audio message; provide a repeattransmission option via the user interface in response to the addresseedevice not accepting delivery of the audio message or declining deliveryof the audio message; and automatically attempt one or more additionaltransmissions of the audio message to the addressee device in responseto a selection of the repeat transmission option.
 2. The non-transitorycomputer-readable storage medium of claim 1, further comprisinginstructions that, when executed by the processor, cause the processorto transmit a request from the mobile communication device to the serverto receive the reply message.
 3. The non-transitory computer-readablestorage medium of claim 2, wherein the reply message is a voice message,the computer-readable storage medium further comprising instructionsthat, when executed by the processor, cause the processor to play thevoice message at the mobile communication device in response to receiptof the voice message from the server.
 4. The non-transitorycomputer-readable storage medium of claim 1, wherein the reply messageis an email message that includes a voice message attached to the emailmessage.
 5. The non-transitory computer-readable storage medium of claim1, wherein the one or more additional transmissions are automaticallyattempted when the addressee device is busy.
 6. The non-transitorycomputer-readable storage medium of claim 1, wherein the user interfacefurther comprises a selectable input to specify a tone of voiceassociated with the audio message.
 7. The non-transitorycomputer-readable storage medium of claim 1, wherein the identifierincludes a telephone number, and wherein the user interface is operableto receive one or more telephone numbers.
 8. The non-transitorycomputer-readable storage medium of claim 1, wherein the user interfacefurther comprises a banner interface area configured to receive aselection of a pre-recorded communication, and wherein the selectedpre-recorded communication is sent with the audio message.
 9. Thenon-transitory computer-readable storage medium of claim 1, wherein theone or more additional transmissions are automatically attempted whenthe addressee device is free but does not answer a call attemptingdelivery of the audio message.
 10. The non-transitory computer-readablestorage medium of claim 1, wherein the user interface provided at themobile communication device includes an affordance whereby a user of themobile communication device can select the repeat transmission option.11. A method comprising: receiving, at a server, a message and anidentifier associated with a destination device from a mobilecommunication device, the message including text input received at themobile communication device; converting the text input into an audiomessage; transmitting the audio message from the server to thedestination device; in response to a determination that the audiomessage was not received by the addressee device, automaticallyattempting one or more additional transmissions of the audio message tothe addressee device; receiving, at the server, a reply message from thedestination device in response to the audio message; and transmittingthe reply message from the server to the mobile communication device inresponse to receiving the reply message.
 12. The method of claim 11,further comprising: receiving a request from the mobile communicationdevice for the reply message; and transmitting the reply message fromthe server to the mobile communication device in response to therequest.
 13. The method of claim 11, wherein the reply message is anemail message that includes a voice message attached to the emailmessage.
 14. The method of claim 11, wherein the message furthercomprises a language identifier specifying a language to which the textinput is to be translated, and wherein the method further comprisestranslating the text input from a first language to a second languagebefore the text input is converted into the audio message.
 15. Themethod of claim 11, wherein the message received at the server furtherincludes information associated with a selection of a pre-recordedcommunication from a plurality of pre-recorded communications presentedat the mobile communication device via a banner interface of a web site,wherein the selected pre-recorded communication is included in the audiomessage transmitted from the server to the destination device.
 16. Themethod of claim 11, further comprising determining that the audiomessage was not received by the addressee device, wherein determiningthat the audio message was not received includes determining that theaddressee device is free but does not answer a call attempting deliveryof the audio message.
 17. The method of claim 11, further comprisingdetermining that the audio message was not received by the addresseedevice because the addressee device was busy.
 18. A method comprising:providing a user interface to a display device of a mobile communicationdevice; receiving a text input and an identifier associated with anaddressee device via the user interface; transmitting a messageincluding the text input from the mobile communication device to aserver for conversion into an audio message and for transmission of theaudio message to the addressee device associated with the identifier;receiving, at the mobile communication device, a reply message from theserver, the reply message indicating that a reply voice message has beenreceived at the server from the addressee device in response to theaudio message; providing a repeat transmission option via the userinterface in response to the addressee device not accepting delivery ofthe audio message or declining delivery of the audio message; andautomatically attempting one or more additional transmissions of theaudio message to the addressee device in response to a selection of therepeat transmission option.
 19. The method of claim 18, wherein thereply message is an email message, and the reply voice message isattached to the email message.
 20. The method of claim 18, furthercomprising transmitting a request from the mobile communication deviceto the server to receive the reply voice message.
 21. The method ofclaim 20, wherein the reply voice message is received at the server in afirst language and wherein the server converts the reply voice messagefrom the first language to a second language before transmission to themobile communication device.
 22. A method for sending an audio messageto an addressee device, comprising: providing a user interface to adisplay device of a mobile communication device, the user interfaceincluding a repeat transmission option that, when selected, causes oneor more additional transmissions of the audio message to be attempted inresponse to a determination that the audio message has not been receivedby the addressee device; receiving a text input and an identifierassociated with the addressee device via the user interface;transmitting a message including the text input from the mobilecommunication device to a server for conversion into the audio messageand for transmission of the audio message to the addressee deviceassociated with the identifier; and receiving, at the mobilecommunication device, a reply message from the server, the reply messageindicating that a reply voice message has been received at the serverfrom the addressee device in response to the audio message.
 23. Themethod of claim 22, further comprising causing the one or moreadditional transmissions of the audio message to be attempted inresponse to the determination that the audio message has not beenreceived by the addressee device.
 24. The method of claim 23, whereinthe determination that the audio message has not been received by theaddressee device includes a determination that the addressee device didnot accept delivery of the audio message.
 25. The method of claim 23,wherein the determination that the audio message has not been receivedby the addressee device includes a determination that the addresseedevice declined delivery of the audio message.